Ref 

# 



Hits 



Search Query 



DBs 



Default 
Operator 



Plurals 



Time Stamp 



L2 



:L3 



L4 



L5 



L6 



L7 



fGpherenGyiadj ipointi: 



252 



probe adj command 



761: 



;l ; ;arid;2;; 



65 



3 and miss and hit 



23 



4 and exclusive 



5 and directory 



6 arid node 



.US-PGPUB; 
USPAt; ; 
EPO;;jpb; ; 
DErWeNT; 
IBMjTDB: : 



OR: 



US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 

lUS-PGPUB; 
USPAT; i 
EPO; JPO; 
DERWENT; 
IBMITDB 

US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 

US-PGPUB; 
USPAT; ; : 
EPO; JPO; 
DERWENT; 
IBMITDB 

US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM TDB 



OR 



OR;; 



OR 



OR 



OR 



US-PGPUB; 
USPAt;: ! 
EPO; JPO; 
bERWENT; 
IBM TDB 



OR- 



ON 



ON 



ON; 



ON 



ON 



ON 



ON 



2005/07/19 20:00 



2005/07/19 20:01 



2005/b7/i?;20:0i 



2005/07/19 20:01 



2005/07/19 20:02 



2005/07/19 20:02 



2005/07/19 20:06 



i 



Search History 7/19/05 8:07:43 PM Page 1 



IEEE Xplore# Search Result 



Page 1 of 1 




^Search Results 



Ho:r:o : Login j Logout | Access Information j A&rts | 
Welcome United States Patent and Trademark Office 

BROWSE SEARCH l££E XPLORE GUIDE 



Results for "((coherency<and>(probe<near/4>command)<and> miss<and>hit<and>exclusive..." 
Your search matched 0 of 1194402 documents. 

A maximum of 100 results are displayed, 25 to a page, sorted by Relevance in Descending order. 



» View Session History 
a New Search 

» Key 

IE££ JNL IEEE Journal or 
Magazine 

IEE JNL IEE Journal or 
Magazine 



Modify Search 



((coherency<and>(probe<near/4>command)<and> miss<and>hit<and>exclusive<ani j 



iEEE 
CNF 



IEEE Conference 
Proceeding 



SEE CNF IEE Conference 
Proceeding 



IEEE 
STD 



IEEE Standard 



Check to search only within this results set 
Qisp&y Format: Citation C Citation & Abstract 



No results were found. 

Please edit your search criteria and try again. Refer to the Help pages if you need assistance revisir 



Help Contact Us Privacy & : 

Khaatf <3 Copvrighi 2001$ i E £&' .» 



http://ieeexplore,ieee.org/search/searchresult jsp?queryl=&scopel=metadata&opl=and&que... 7/19/05 



Results (page 1): coherency (probe command) miss hit exclusive directory node 



Page 1 of 6 



HI 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: ® The ACM Digital Library O The Guide 
jcc i h e rency ( |p ro becomm a n d i rectory node; 



i Feedback Report a problem Satisfaction 
survey 



Terms used 

coherency probe command miss hit exclusive directory node 



by rtr6SUltS ! releVanC:e .. 
Display 
results 



^ Save results to a Binder 

m 



> — ~ ,■ — 1 Search Tips 

l»ndrt form. J*j □ Open results in a new 
window 



Found 17,989 of 157,873 

Try an AdyaacMJ.earch 

Try this search in The ACM Guide 



Result page: 12345678910 



next 

Relevance scale UQBii 



Results 1 - 20 of 200 
Best 200 shown 

1 Delayed consistency and its effects on the miss rate of parallel programs 
Michel Dubois, Jin Chin Wang, Luiz A. Barroso, Kangwoo Lee, Yung-Syau Chen 
August 1991 Proceedings of the 1991 ACM/IEEE conference on Supercomputing 

Full text available: , |flpdf(1_ : 01 .[ylBJ Additional Information: Ml citation, referenc.es, cjtirvgs, index terms 



2 Query evaiuation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), volume 25 issue 2 

Additional Information: MLcjtation, abstract refejejices, citings, index 
terns, review 



Full text available: 1 



odft9.37 MB j 
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provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 



3 Eerforrra^ 

Henri E. Bal, Raoui Bhoedjang, Rutger Hofman, Ceriel Jacobs, Koen Langendoen, Tim Ruhl, M. 
Frans Kaashoek 

February 1998 ACM Transactions on Computer Systems (TOCS), Volume 16 issue x 

Additional Information: fujj. citation, abstract, references, citinas, index 



Full text available: fSpdfC.179.39 KBj 

^ terms, review 
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other DSMs. The article gives a quantitative analysis of Orca's coherence protocol (based on 
write-updates with function shipping), the totally ordered group communication protocol, 
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In this paper, we present a cache coherence protocol for MIN-based multiprocessors with 
two distinct private caches: private-block caches containing information private to a 
processor, and shared-block caches containing data accessible by all processors. The 
protocol utilizes a coherence control bus (snooping) for connecting all shared-block cache 
controllers. Timing problems due to variable transit delays through the MIN are dealt with 
by introducing Transient states in this protocol. Assuming ... 
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One potentially attractive way to build large-scale shared-memory machines is to use 
small-scale to medium-scale shared-memory machines as clusters that are interconnected 
with an off-the-shelf network. To create a shared-memory programming environment 
across the clusters, it is possible to use a virtual shared-memory software layer. Because of 
the low latency and high bandwidth of the interconnect available within each cluster, there 
are clear advantages in making the clusters as large as possi ... 

6 Design^ 
architectures 

A. Goto, A. Matsumoto, E. Tick 

April 1989 ACM SIGARCH Computer Architecture News , Proceedings of the 16th 

annual international symposium on Computer architecture, volume 17 issue 3 
Full text available: fS pdffl 1 7 MB i Additional Information: Ml citation, abstract, references, citings, index 
' ^ *'""'"'* terms 

This paper describes the design and performance of a tightly-coupled shared-memory 
coherent cache optimized for the execution of parallel logic programming architectures. The 
cache utilizes a copy-back write-allocation protocol having five states and a hardware lock 
mechanism. Optimizations for logic programming are introduced in four software-controlled 
memory access commands: direct-write, exclusive-read, read-purge, and read-invalidate. 
In this paper we describe these operations and pres ... 
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The problem of building a scalable shared memory multiprocessor can be reduced to that of 
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building a scalable memory hierarchy, assuming inter processor communication is handled 
by the memory system. In this paper, we describe the VMP-MC design, a distributed parallel 
multi-computer based on the VMP multiprocessor design, that is intended to provide a set 
of building blocks for configuring machines from one to several thousand processors. VMP- 
MC uses a memory hierarchy based on shared caches ... 
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"STiNG" is a Cache Coherent Non-Uniform Memory Access (CC-NUMA) Multiprocessor 
designed and built by Sequent Computer Systems, Inc. It combines four processor 
Symmetric Multi-processor (SMP) nodes (called Quads), using a Scalable Coherent Interface 
(SCI) based coherent interconnect. The Quads are based on the Intel P6 processor and the 
external bus it defines. In addition to 4 P6 processors, each Quad may contain up to 4 
GBytes of system memory, 2 Peripheral Component Interface (PCI) busses for ... 
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Two possible modes of Input/Output (l/0)are "sequential" and "random-access", and there 
is an extremely strong conceptual link between I/O and communication. Sequential 
communication, typified in the I/O setting by magnetic tape, is typified in the 
communication setting by a stream, e.g., a UNIX 1 pipe. Random-access communication, 
typified in the I/O setting by a drum or disk device, is typified in the communication setting 
by shared memory. In this paper, we study and s ... 
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comparisons 
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This paper introduces dynamic self-invalidation (DSI), a new technique for reducing cache 
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coherence overhead in shared-memory multiprocessors. DSI eliminates invalidation 
messages by having a processor automatically invalidate its local copy of a cache block 
before a conflicting access by another processor. Eliminating invalidation overhead is 
particularly important under sequential consistency, where the latency of invalidating 
outstanding copies can increase a program's critical path. DSI is ap ... 
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Full text available: mpdfM. 45 MB} 
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Current high performance computer systems use complex, large superscalar CPUs that 
interface to the main memory through a hierarchy of caches and interconnect systems. 
These CPU-centric designs invest a lot of power and chip area to bridge the widening gap 
between CPU and main memory speeds. Yet, many large applications do not operate well 
on these systems and are limited by the memory subsystem performance. This paper argues 
for an integrated system approach that uses less-powerful CPUs that are ... 
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Using simulation, we examine the efficiency of several distributed, hardware-based 
solutions to the cache coherence problem in shared-bus multiprocessors. For each of the 
approaches, the associated protocol is outlined. The simulation model is described, and 
results from that model are presented. The. magnitude of the potential performance 
difference between the various approaches indicates that the choice of coherence solution is 
very important in the design of an efficient shared-bus multi ... 

16 Algorithms for scalable synchronization on shared-memory multiprocessors 
John M. Mellor-Crummey, Michael L. Scott 

February 1991 ACM Transactions on Computer Systems (TOCS), volume 9 issue 1 

Full text available- j ^sdft3.07 MB) Additional Information: MLQitation, abstract references, citings, index 
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Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in 
shared -memory parallel programs. Unfortunately, typical implementations of busy-waiting 
tend to produce large amounts of memory and interconnect contention, introducing 
performance bottlenecks that become markedly more pronounced as applications scale. We 
argue that this problem is not fundamental, and that one can in fact construct busy-wait 
synchronization algorithms that induce no memory or interc ... 
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The purpose of a distributed file system (DFS) is to allow users of physically distributed 
computers to share data and storage resources by using a common file system. A typical 
configuration for a DFS is a collection of workstations and mainframes connected by a local 
area network (LAN). A DFS is implemented as part of the operating system of each of the 
connected computers. This paper establishes a viewpoint that emphasizes the dispersed 
structure and decentralization of both data and con ... 
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June 2002 Proceedings of the 16th international conference on Supercomputing 

Full text available: ^ .pdg217.„27. KB_i Additional Information: fgJl.citatipn, abstract, references, index terms 

Active memory systems help processors overcome the memory wall when applications 
exhibit poor cache behavior. They consist of either active memory elements that perform 
data parallel computations in the memory system itself, or an active memory controller that 
supports address re-mapping techniques that improve data locality. Both active memory 
approaches create coherence problems— even on uniprocessor systems— since there are 
either additional processors operating on the data directly, or the ... 

Keywords: active memory, address re-mapping, cache coherence 
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VMP is an experimental multiprocessor being developed at Stanford University, suitable for 
high-performance workstations and server machines. Its primary novelty lies in the use of 
software management of the per-processor caches and the design decisions in the cache 
and bus that make this approach feasible. The design and some uniprocessor trace-driven 
simulations indicating its performance have been reported previously. In this paper, we 
present our initial experience with the V ... 

20 Exploiting high-ievel coherence information to optimize distributed shared state 
DeQing Chen, Chunqiang Tang, Brandon Sanders, Sandhya Dwarkadas, Michael L. Scott 
June 2003 ACM SIGPLAN Notices , Proceedings of the ninth ACM SIGPLAN symposium 

on Principles and practice of parallel programming, volume 38 issue 10 
Full text available: ^ sdf(641.96 KB) Additional Information: full citation, abstract, references, index terms 

InterWeave is a distributed middleware system that supports the sharing of strongly typed, 
pointer-rich data structures across a wide variety of hardware architectures, operating 
systems, and programming languages. As a complement to RPC/RMI, InterWeave facilitates 
the rapid development of maintainable code by allowing processes to access shared data 
using ordinary reads and writes. Internally, InterWeave employs a variety of aggressive 
optimizations to obtain significant performance improvements ... 
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